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Parallel machine and manual literature searches on 
the subject of chemical evolution and the origin of life were 
compared on six characteristics: (1) precision, (2) recall, (3) 
novelty, (a) uniqueness, (5) time cost per citation, and (6) dollar 
cost per citation. The manual search outperformed the machine on 
precision, novelty, uniqueness, and dollar cost per citation although 
this was based on partial cost data for the manual search. There was 
little difference in recall between the two methods. For this subject 
area, "Chemical Abstracts'^, "International Aerospace Abstracts", and 
the Automatic Subject Citation Alert service were found to be the 
most effective sources for overall recall precision, novelty, and 
uniqueness. RECON and MEDLARS were found to be the most efficient in 
terms of times and costs. The study concludes that while the manual 
search had a slightly better overall performance, both modes are 
necessary for a comprehensive multi-disciplinary literature survey. 
(JG) 
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CliAPTER 1 
INTRODUCTION 



OBJECTIVE: 

The purpose of this study was to compare the results of 
parallel machine and manual literature searches in the preparation 
of an ongoing bibliography. This bibliography is comprehensive and 
rnultidisciplinary on the subject of Chemical Evolution and the Origin 
of Life. The specific objectives of this work were to determine: 

1. which source{s) is the most effective using the U3er 
criteria of recall, precision, novelty and uniqueness 

2. which source(s) is the most efficient using the 
Management criteria of time and cost 

5. possible v;ays to economize or improve future searches. 

VALUE OF THE STUDY: 

In spite of the increase in automated infonnation retrieval 
systems, little study has been done to determine hov/ effectively these 
systems can replace or aid the traditional methods of search. The vast 
majority of the studies done have been on the relative value of indexing 
p2^ocedu2X>s within one system* When systems have been compared, the objective 
of the study has been a very siriall synthetic seai^ch of a current awareness 
nature, focusing on one or two characteristics only. In these days of 
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tiglit money and rising costs, nianagement needs quantitative data about 
how best to use the tools and services available to them in real-life 
situations. It is especially important with ongoing projects, such 
as the bibliography under study, to know wliat is being done and what 
can be donp more effectively and or efficiently. An analysis of 
multiple characteristics of a real and ongoing, retrospective 
literature search has not been undertaken to date. It is felt that 
the Chemical Evolution and Origin of Life Bibliography provides an 
acceptable basis for such a study. 

Each year for the past thi^ee years, the Chemical Evolution 
Branch, Life Sciences, NASA-Ames Research Center has commissioned the 
updating of the basic bibliography, "Chemical Evolution and the Origin 
of Life/' compiled by Martha W, V/est and Dr. Cyril PonnampeiTuna ard 
published in. Space Life Sciences 2(l970) 22^-295. Tiiese supplements 
include all literature in the subject field published the previous 
year and found in several scientific disciplines - astronomy, biology, 
chemisiir'y, geology, etc. The circumstances of this literature search 
seemed ideal in size, scope, and subject array for a comparison of the 
various criteria mentioned earlier. In addition, the conditions of the 
literature search are duplicated in many special libraries all over the 
country - 

1. a library well-stocked vath the indexes and abstracting 
services geared to the mission of the parent agency 

2. availability of several commercial computerizc^d search soi'viccs 
jj. an in-house information retrieval ^yotcm 

a literatur*^^ Searcher :^^ith both library and subject baekgreund. 
With these aggregate conditions present, it is hoped that the results of the 
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current study can fill a void in the evaluation literature. Equally 

important, it is hoped that this study vail provide an objective basis 

for User and Mamgement decisions in economizing future bibliographies 

as well as aiding the Searcher in more proficient surveillance of the literature. 

REVIEW OP F(EIATED WORK: 

Studies of many characteristics of retrieval systems (computerized 
and manual) have been made but efforts have cone en-?: rated on xhe input 
or indexing stage. When attention has been focused on the search or output 
stage, the work has been mostly an evaluation of intiexing systems vri^thin 

the system itself - rarely as a comparison with another system, manual 

1 2 ^ 

or computerized. For example, Cleverdon, Lancaster, and Salton, 

m their evaluatiojis, arrive at complicated ratios of recall, precision, 

etc. for various indexing methods within a single infomation retrieval 

system but never do they ansv/er the (question, ^'does tliis system as a whole 

ft 

retrieve infonnation more completely, faster or less expensively than 
that system?" 

Within the manual search mode, Spencer compares the pi'eparation 
of a di\ig bibliography using Cherni<^al Abstracts and Irdex Medicus for 
the years 1956-196^^ vrith the results of another using Science Citation 
Index 19^^1 and , An equal len^h of earch time was used with each 

tool aixl the results v;ere compared in various areas. She foimd that 
each of the three indexes produced a high percentage of unique references 
but no appreciable differences in efficiency. But when the total search 
time v;as limited to very short time spans, Science Citation Index 
perfomed niueh more profitably than did Chemical Abstracts or Index 
Medicus . Sie made no attempt to generalize her conclusions. 
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Within the computerized mode, emphasis has been on current 
awarenoss sei'vices rather than retrospective searches. In 1971, Barker, 
Veal a;id Wyatt compared the relative performance of foui^ major 
Chemical Abstracts Sewice magr.etic tape data-bases in teras of 
relative currency and retrieval efficiency. Ihta-bases varied from 
"titles only^' to ^'titles plus ke^'vord phrases" to "titles plus 
abstracts." Fifty questions were used and profiles for each were 
mtched against each of the data-bases. Precit:ion and recall percentages 
were calculated foi^ each question. Results were as they expected ~ the 
recall increased as the volujne of searchable material per citation increased. 

V/lien the comparison is between manual and computerized searches, 

the emphasis, again, has been on current awareness tools or MEDLARS. H. P. 

Angstadt^ of the Sim Oil Company compared the output of compiiter searches 

of Chemical Titles and Chemical Abstracts Condensates against the product 

of several searches scanning the same material by their nonnal procedures. 

Twenty-eight single question profiles were pi^epared and used as the basis 

of the study. Only computerized results >vrere evaluated in tenns of recall 

and precision. A second set of results was calculated after a revision 

of most of the profiles. Once optimum profiles were developed, Angstadt 

found no appreciable differences in precision and recall values for the 

tv;o computerized searches. 

Chemical Titles v/as, also, the subject for a study done by LjTLch 
7 

and Smith at the University of Oxford in 1970. Tliis investigation compai^ed 
the Chemical Titles output on one carefully constructed profile over a 
period of a year with a manual search of the relevant literature. Results 
v;ere reported in numbers rather than percentages, A.^as of comparisons 
were very limited - only those items found by both computer and manual 
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searching received any degree of statistical manipulation. Relative 

speed of retrieval v/as the important factor in this study. 

Another study of autoimted current awareness services versus 

g 

manual selection was done in 1970 by R. H. Searle of the United Kingdom 
Atomic Energy Agency. He compared references retrieved from Chemical 
Titles and ASCA with material retrieved by information and scientific 
personnel using the primary literature. He found that, at present, 
the computer- based sewices using small, high relevance profiles could 
retrieve only about ^0% of those retrieved by himan selection. 

Uie most thorough comparison studies have been done \n.th the 

9 

printed Index Medicus and the computer-based MEDIARS. In 196?, Ohta 
compared the results of seven medical bibliographies prepared with the 
use of both tools. Her study indicated that, at present, neither method 
gave adequate output and results varied so greatly from bibliography 
to bibliography that no general conclusions could be reached. 

MEDLARS and Index Medicus v^re again the basis of a portion of 
a study done in 1968 on the literature of Ophthalmology. Both Virgo*^^ 
and Miller "^"^ reported aM interpreted data generated by the study. 
Different methods of calculating precision percentages led to dramatic 
differences in results. Still, both reached the same general conclusion 
that MEDIARS searches are not clearly superior to manual searches of 
Ipilex Medicus and that for extensive retrospective searches more than one 
secoridary source must be used. 
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DEFINITIONS OP TERMS AND ABBREVIATIONS 



Data-base .Infonnation stored by an infonnation 

retrieval system, in printed or machine- 
readable form* 

Demnd search Search conducted in response to a specific 

demand • 

Novelty The ability of a retrieval system to reveal 

relevant information to the User for the first 
time. 

Precision The ability of a retrieval system to reveal only 

information of interest to the User and to 
hold back unwanted infonnation. 

Recall. The ability of the retrieval system to uncover 

all relevant infonnation stored in the data^-base. 

Reference The bibliographic data retrieved by searching 

the eight data-bases of this study, used 
interchangeably vdth "item^' arri ^'citation". 

Relevant item*. Item, citation or reference useful to the User 

in relation to his infonnation need which 
prompted his request. 

Search Action taken from the receipt of an infonnation 

request to the retrieval of the final dociunent 
or its citation. 
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Search Component One of the eight infonnation systems searched 

as part of the basic literature sm^vey of this 
study. 

Search Strategy... The translation of a request into the language 

of the infonnation system used. 

Uniqueness The ability of a retrieval system to reveal 

relevant infonnation not i^evealed by another 
retrieval system. 

UseTi , The requester of information. 

Profile The words or phrases v^ich describe the 

interests of the user. 

Noise. • .Unwanted. or nonrelevant infonnation retrieved 

by an infonnation retrieval system* 



" METHODOLOGY; 

The basis of this study v^as a ready-made situation. The annual 
bibliography updating "commissioned" by NASA -Ames was chosen as the search 
question since it involved rather broad subject areas and had proven 
user-value; therefore, it filled the criteria of a real search situation, 
not a synthetic one. The study v/as confined to those references bearing 
a 1972 publication date. The experiment was extended through the first 
six months of 1975 to allow for time lag of data input. Calendar time 
span was not a factor of concern in this study. 

A conference between the User, the Searcher and a Relevancy Judge 
was held to set parameters for the search. A comprehensive list of 
relevant indexjLng tenns was compiled by the Searcher (Appendix A). This 
list v/as then manipulated to fit the vocabulary of each data -base searched. 
Eight services were chosen as infoiination systems most likely to yield 
relevant references. The four printed sources selected for manual searching 
provided abstracts as well as titles of indexed literature. The four 
computer-based services gave "titles only" and were selected essentially 
bcCtiuGc they were the ones available or Gubscribed to by the User. The 
eight iiiformation systems used in the preparation of the bibliography 
are listed below. Special characteristics of each as related to the 
conduct of the study are described briefly. 
Computerized Sources: 

1. ASCA (Automatic Subject Citation Alert). This is a 
weekly conmercial current awareness service published 
by Institute for Scientific Information. A >x?rsonal 
profile is used to search a machine readable data-base 
containing natural science research literature from 

ERIC 



over 2^00 source publications. Entry to the data- 
base may be made by author, subject or corporation. 
The profile submitted by the User (Appendix B) vas 
accepted by the Searcher, although a few changes 
were suggested to produce a higher precision ratio 
in the future. Because this was a retrospective 
search, the changes affected only a small portion 
of the ASCA output used for this study. 
BIOSIS Standaixi Profile /5/8l2 (Biochemical Evolution). 
This is a commercial current awareness service published 
monthly by Biosciences Information Service. A standaxxi 
profile is used to search a machine readable data-base 
containing bioscientific research literature collected 
from approximately 80OO source publications. This 
data^base is the complete index files of each issue 
of Biological Abstracts and Bioresearch Index . Entry 
is by author, subject, biosystem or a Cross index. 
Profile //812 in the broad field of biochemical evolution 
is predetermined by the publishers of this service and 
is designed to interest more than one scientist. This 
Searcher, therefore, had no control over the search 
tenns used. 

MEDIARS (Medical Literature Analysis and Retrieval System). 
This is an off-line search service of the National Library 
of Medicine (IJIW) . A profile (Appendix C) designed by the 
Searcher is used to search a machine-readable data-base 
containing medical literature collected from over 2^f00 journals. 
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Entry is by subject and author. Periodic updating 
was made by MPDLINE to allow for a six month time lag of 
data input, MEDLINE is an on-line search sei'vice of the NLM. 
Its data-base is limited to approximately 1000 of the most 
heavily used Journals indexed for MEDLARS for the last 
three years. 

4. RECOl^I (Remote Console). This is a real-time, on-line, 
time-sh/^red computerized infomation retrieval system 
operated by NASA. The data-base contains aerospace 
research literature collected from government, industrial 
and academic researchers. STAR and lAA together serve as 
announcement bulletins for data input into this system. 
Entry is by author, subject, and corporate author. The 
profile used for this study vras designed by the Searcher 
(Appendix D) . 
Manual Sources: 

' P A (Biological Abstracts ) . This printed abstracting and 
indexing service is published twice monthly by Biosciences 
Information Service. It indexes biological, medical and 
biochemical literature from almost 8000 source publications. 
Each issue has author, permuterm subject, biosystematic 
and Cross indexes. Each volume (two per year) has a cumulative 
index. The Searcher used cumulative indexes for those 
volumes available. Otherwise only the subject index. 
Biological Abstracts Subjects in Context (B.A.S.I.C. ) , of 
the individual, issues was used. Bioi^esearch Index , a 
companion service with BA v;as not one of the information 
systems included in the survey, but is considered in 

O 
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certain conclusions reached later. It is published 
monthly by Biosciences Infonmtion Service and provides 
approxirrkately 100,000 additional citations (not abstracts) 
annually of items not found in M. These include 
symposia, review journals and cooks, government reports, 
etc. The indexing system is the same as that for B^. 
The complete index files of BA. and Bioresearch Index 
are the data-base searched by the BIOSIS Standaixl Profile 
Sei"^ice. 

CA (Chemical Abstracts ) . This is a printed abstracting 
and indexing service published weekly by Chemical Abstracts 
Services and covers chemical and chemical engineering 
research literature from approximately 12,000 source 
publications. Each issue has a keyword subject > author, 
and patent index* Cumulative indexes are issued for each 
volume (two volumes each year) . Cumulative subject 
indexes v;ere never available at the right time so all 
searching was done ir the keyword indexes of individual 
issues, 

lAA (International Aerospace Abstracts ) . This printed 
abstractlTig and indexing service is published twice each 
month by the American Institute of Aeronautics and 
Astronautics t Inc. It covers published literature in the 
field of aeronautics and space science and teclinology. 
Each issue has five indexes classi ficd subject, pei\3onal 
author, contract number, report number and nccec^sion 
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nuiiiber. Cumulative indexes are published semiannually 
and annually. The cumulative subject indexes were 
used when available. Otherwise, the subject irdexes 
of individual issues wore used. The RECON data-base 
includes all items announced in this publication* 
h, STAR (Scientific and Teclmical Aerospace Reports) . 
This is a printed abstracting and indexing service 
issued semimonthly by NASA. It covers the report 
literature on the science and technology of space 
and aeronautics • Each issue has five indexes - classified 
subject, personal author, corporate author, contract 
number, and accession nimiber. Cumulative indexes are 
published semiannually and annually. Semiannual subject 
indexes were used for one voluine only. Subject indexes 
for individual issues v:ere used for the other two volumes 
searched. The RECON data-base includes all items announced 
in this publication. 
All of the sources v;ere international in scope. Whenever possible, the 
output of computorized sources v:as handled first in order to prevent bias 
fron having seen abstracts in the manual soui^ces* 

The machin.' literature search started at the end of February, 1973 
and v;as concluded in July, 1973* The complete 1972 output for ASCA and the 
BIOSIS Standard Profile service were immediately available and a I4EDIAP,S 
demand search v/as requested immediately along vath monthly updates. Tlie 
RECON search v/as delayed as long as possible so the necessity for several 
updates vrith high duplication of references would bo minimized. Manual 
searching was begun during 1.3 first week of April, 1973- 'Th^ entii^e 
1972 output of each information system (manual or computerized) was 
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searched at one time. Output after January 1, 1975 was searched for 
1972 references as it arrived at MASA-Amos Life Sciences Library. Searchiii^ 
was continued through June 50, 1975 output to allow a six-month time lag 
for data input. As noted previously, the end product of this study was 
an exhaustive bibliography so the amoimt of time devoted to each source 
was not limited. 

When the gross searching was completed, the User^s literature file 
was checked for duplication of references. This data was necessary for 
calculating novelty and recall percentages. 

When manually searching printed sources, only subject indexes 
were used. Since time was not limited and degrees of relevancy were not 
assigned, all subject terms were considered equal and were searched in 
alphabetical order. When entries under relevant tenns were found, abstract 
niunbers Mere jotted down and consulted after all index searching v/as 
completed. Only those items thouglit to be relevant or possibly relevant 
were then "retrieved". Very few nonrelevant abstracts were consulted 

STAR » lAA and BA since full or partial titles are given in their 
indexes. 

As items were retrieved either from computer printout or printed 
index, bibliographic data v/as transferred to ^1" x 6" cards lising the 
fonriat of the flnvl bibliography. The source infoiTnation was encoded 
on the rear of the card. For exajnple , an item found in the ASCA printout 
v;as marked "ASCA 6 Mar 75 P^^" or "CA 75:9?255y" if found In CA. These 
ca>\is were alpJiabetized daily and as duplication arose, source information 
was cunralated on the rear of a master card. Duplicate carxis were then 
removed to another file for other uses. 

A Search Recoixi was kept for each source giving volume and issue 
niuribers searched, subject headings used, and time expended. In the case 



of computer printouts with no voliune or other identifying numbers, 
printout dates v.-ere used. Tlic tijne-keoping procedure was altered as 
the study t^rogressed (see Chapter 2, Unit Time Characteristic). 

All references thought to be relevant or possibly relevant by 
virtue of title or abstract ivrere retrieved. Pull texts of as many 
items as possible were retrieved after the search was completed. The 
holdin^:s of the TIASA Libraries, San Jose State University Library 
and Stanford University Libraries were used. In some instances, 
Interlibi^ary loans were obtained from other libraries. If the item was 
in a foreign language, the Searcher used English translations or ' 
English abstracts by the author, if available. In some cases, the 
Searcher was able to translate enough to determine relevancy. If 
there v;as still a question of relevancy, the Relevancy Judge made 
the final decision. All items judged not relevant v/ere placed in a 
nom^elevant file so that the data could be available for this study. 



CHAPTER 2 
OBSERVATIONS AND RESULTS 



General: 

Tile literature search produced 2701 references from all 
sources, 2677 from the eight sources treated in this study* Of thp 
retrieved items, 280 were considered relevant and included in the 
bibliography update; 2j50 of the 280 were retrieved by the eight 
sources. As shown in Table I, 26 references considered relevant 
by the User were not retrieved by the search. These were supplied 
by the User's personal file. All but 7 were articles in three newly 
published books and most probably victims of time lag, (Contents of 
two of the books were retrieved by the July and August BIOSIS Standard 
Profiles.) Hopefully, these 19 items will be retrieved in the next 
aimual update. The 7 remaining viere in the geological field. Failure 
to retrieve them could have been due to inadequate capabilities of the 
Searcher in that subject area or lack of coverage by any of the data^ 
bases searched. 

Table I. Analysis of I972 Published References 



Total References Retrieved by all sources = 2727 

/ — ~~r : 



Retrieved from 8 sources of 
study 2677 



Retrieved from other 
sources = 24 



Total r e 1 o vant 



230 



Supplied by user = 26 



Unknovjn to user = 15^ 



Total relevant 



2h 



^1 



Total relevant 26] 



Unknovjn to user - 20 



^■•eferences retrieved from current journals; bibliographies 
of reiev.^nt .rbicles; publisher's catalogs, etc. 
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Por the purposes of this study, the assumption was made that all 

rolovatit roferoncos found by the Searcher plus the 2( items in the 

User's file but not found by the Searcher constituted the entire 

secjnent of I972 literature in the field of Chernical Evolution and the 

2 12 

Ox^irrin of Life (280 references). Foimilas advanced by F. W. Lancaster ' 
of the University of Illinois Graduate School of Libr'ary Science v/ere 
used to calculate percentages of recall, precision, novelty, uniqueness 
and tiine cost. 

RTXALL AIID PRECISION CJLAi^CTERISTICS : 

'nio recall perfomance of a system is ex:prvs::ed quantitatively 
as the ratio of the number of relevant references retrieved by that 
system to the total number of relevant references in the literature. 
Tlie precision perfomance of a system is expressed quantitatively as 
the ratio of the nimiber of relevant references retrieved by that system 
to the total number of references, relevant and nonrelevant, retrieved 
by that system. Table II tabulates recall and precision values for 
oacli of the eig}:it sources (or search components), which are grouped 
aocoixiiri^ to the search mode - manual or computerized. For example: 

ASCA Precision ^ ^ niunber of relevant references retrieved x 100 I09 x 100 - 7?^ 

number of references retrieved I518 

ASCA Recall % - number of relevant refere n ces retrieved x 100 109 x 100 r.- y^^S 

total number of relevant references 28O 
in literature 
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Table II. Recall and Precision Characteristics for Search Components 



Component or Souroc- 


Retrieved 


Relevant 


Precision 


Recall 


ASCA 


1518 


109 


1% 




RECON 


210 


m 






BIOSIS Standard PixDf ile 


426 


31 


1% 


11^ 


MEDLARS 


258 


30 


y% 


10^ 


BA 


2h 


20 




1% 


CA 


137 


110 




1B% 


lAA 


121 


117 


91% 




STAR 


5 


5 


100% 


1% 



Column 2 of Table II shows a range in the number of retrieved 
references from ASCA's high of 1518 to STAKES low of 3. In past years 
STAR has yielded decidely more citations. Since the U. S. Government 
has curtailed niany of its research programs, the report literature 
indexed by STAR has decreased in several areas. Tlie increased niunber 
of items retrieved by lAA reflects the switch to jouiTial litei^ature. 

Only ASCA and BIOSIS Standard Profile had recall values wliich 
v;ere greater tb^n their precision percentages. The two sewices produced 
over ^0% of the total referencer^ retrieved by the eight sources. This 
has resulted in very low precision percentages of 7^ each. ASCA's 
extraordinarily large yield of references is due in i^art to the 
sopliisti cation of the service. I'his tool can search foi"^ relevant 
literature by specific word, phrase or stem, by autlior, corporation or 
source, or by cited work or author. The latter means the oystem can 
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search for any item wliich cites a particular reference or the work of 
a particular author. But a badly designed profile can result in the 
retrieval of a lot of '^noise." For example, the profile for this search 
(Appendix B) has directed the computer to retrieve work by Miller 
as well as items by other authors who cite Miller . It also directs 
the computer to do the same for S. L. Miller, a scientist working 
in the field of Origin of Life. The computer, therefore, retrieves 
I'/orks by and citing Millers from many different fields* Purthe:^V 
the profile also directs a search for references having the word 
Venus. This yields Venus Fly-Trap literature as well as that of 
the planet Venus. But a better designed profile couJd result in 
prohibitive costs. In this service, the specific word, stem, etc. and 
how it is used in the profile determines the charge. It may be more 
efficient in the long run to wade through the nonrelevant matter picked 
up by a more general, less costly tern. 

The equally low precision percentage for the BIOSIS Standard 
Profile v/as expected since it is a service designed to cover the 
larger field of biochemical evolution. Its format gave it an advantage 
v/hich allov;ed for optimum retrieval by the Searcher. All indexing 
terms assigned a reference are listed with the citation. The Searcher, 
therefore, did not have to rely on title alone as \^/ith other computerized 
output. 

Tl\e abstracting character of the manually searched sources was 
reflected in the smaller total number of items retrieved from those 
sources. Ch*?nces for retrieval, of only relevant articles was greatly 
increased. And the resultant precision values are very high. As was 
expected, CA had substantial precision and recall values when compared to 
the other sources. Tliis could be a result of its broad coverage - the 
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material was there to be found, Tlie biggest surprise to the Searcher 
was the excellent perfonnance of lAA , It had the highest recall 
value and the second highest precision value amon^ all the sources. 
Since lAA and STAR are printed announcements of the RECON data-base 
input, the 12 percentage points difference leads the Searcher to 
suspect that a better designed RECON search strategy is possible. 
The RECON search should have retrieved, at least, the 11? items that 
the lAA retrieved. 

Of the automated systems, thn NASA RECON was expected to 
perform much better than it did. Since the mission of this information 
retrieval system is to serve all NASA interest groups, it should optimumly 
include all references of interest to this literature search, and yet, 
its recall value was readily surpassed by that of the commercial service, 
ASCA, Again, all evidence seems to point to Searcher inadequacies. 

Because of its limited subject coverage and the broad subject 

tenns which must be used in this search, MEDLARS was expected to yield 

few relevant references and a good deal of "npise^' with a resultant 

low recall and precision percentages. It did just that. Because the 

subject area of this literature search is so much broader tlian the 

MEDLARS data-base coverage, the results are not adequate for a valid 

comparison v/ith the results of studies done by Virgo, "^^ Miller^^ and 
2 

Lancaster. Table III shov;s the wide discrepancies between those studies 
v^iich were limited to medical questions and the present multidisciplinary 
study . 
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Table III > Comparative Results of MEDLARS Perfonnance in Various Studies 



Characteristics Virgo Miller Lancaster Present Study- 

Recall Percentage 50.4 - 48 57*7 10 

Precision Percentage 54*3 l8 50.4 15 



UNIQUEINESS NOVELTY CHARACTERISTICS: 

Tlie uniqueness and novelty characteristics are shown in Table IV. 
Uniqueness is the percentage of relevant references v;hich were found 
exclusively by a source and is expressed as the ratio of the number of 
unique items retrieved by a system to the total relevant references 
found by all sources or systems. Novelty is defined as the percentage 
of relevant items brought to the User*s attention for the first time 
by this literature search a:nd is expressed as the ratio of the number 
of novel items retrieved from a source, or system, to the total number 
of relevant items retrieved by all sources or systems. 

The novelty values range from CA's high of 2^1% to a low of 0/(% 
for STAR > Again, because of its very broad coverage, CA has achieved 
the highest novelty value. Hov/ever, the ASCA yield with its 62 {22%) new 
relevant references may be of considerably -more value to the User because 
of its currency factor. Although there is high duplication among the 
novel items, much less effoii/ and time is required to retrieve them 
from ASCA. 



ERIC 



21 

Tablo IV. Uniqucnoss and Novelty Characteristics for Search Components 



Search Component Unique to 

Source 


Uniqueness 


Novel to 
User 


Novelty 


ASCA 


18 




62 


22^ 


RECON 


1 




26 


9% 


BIOSIS Std. Prof* 


7 




19 


6.8% 


MEDIARS 


5 


1.1% 


20 


7-1% 


M 




l.>i% 


lif 


5% 


CA 


28 


10% 


75 


27% 


lAA ' 


3 


1.1% 




15% 


STAR 


0 


0 


2 


0.7% 



Perhaps the most telling measure of the value of a source to a 
literature search of this nature is the uniqueness characteristic - how 
many items did that source alone contribute to the total? A quick glance 
at the second column figures shov:s that STAR has no value to the completed 
literature search* The three relevant items retrieved from this source 
v;ere duplicated in all cases by the RECON search. The extremely low 
nombc^r of rofei-enoes unique to RECON is due mainly to the duplication 
of IA/\ and STAR coverage. Theoretically, lAA should, also, have 0% 
uniqueness since it is merely an announcement bulletin for published 
literature being indexed into the RECON data-base. Its comparatively high 
15/0 uniqueness value is again, strongly indicative of weak search strategy 
vrlth the RECON search. The generally low uniqueness values reflect the 
v/idesprcad duplication of the data-bases of all sources. 

O 
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UNIT SEARCH TIME CHAMCTERISTIC : 

At the beginning of the study, time charts were kept for all 
searching h^xt it soon became evident that this would not be a fair 
basis for comparison* It was possible to record accurately the time 
spenc sear'cnxn^ inueAtjy axju oransx"ei*iuiig biuliugi-aphic infomation 
to the Searcher's cards for those sources searched manually, Tliis was 
not possible for all of the computer searches. Since this was a real 
search situation, computer printouts were scanned as received. Often 
the ASCA printouts were so short as to take just a couple of minutes • 
fractions of a irinute thus become crucial to the total tirr*e picture. 
Since this was also true of the monthly BIOSIS Standard Profile 
service and MEDLINE updates , the Searcher decided to calculate an 
average scanning time per citation and hopefully reduce the margin of 
error. Tliis was done by recording precisely the time taken to scan a 
specific number of references from a source and computing an average 
scanning time for eacii ciLation for each source, then multiplying by the 
total nuj|(B%|_of references retrieved by that system. The times differed 
from system to system because of variation in format and data given. 

Time for recording bibliographic data was treated in a like manner 
and an average recording time per citation was calculated. This was 
multiplied by the number of references recorded from that source - both 
relevant and subsequently nonrelevant. This unit time also varied 
among sources due to ease of converting from one format to that used for 
this study. These tv?o factors v/ere then suimied to arrive at a total 
search time for that system. For example: 
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A3CA Search. 

35 citations seamed in 5 minutes = 1/7 min per citation 

i/7 X 1518 references retrieved - 217 minutes of scanning time 

IJh relevant or (?) relevant citations recorded 

X 

2 minutes recording time per reference = 268 minutes recording tijme 

217 minutes scanning time 
485 minutes search time 

The total search time for each source then was the sum of the 
time sptjut scanning the published index or the computer printout for 
relevant citations and the time necessaiT to record the bibliographic 
data of those citations. Results of these measurements are shown in 
Table IV along with an extended unit of measurement for comparison. 

To know that 485 minutes was spent on the ASCA search and 
only llh minutes on the MEDLARS search does not really serve as a 
valid basis for comparison since the ASCA search yielded IO9 relevant 
citations and the MEDLARS search only JO. In order to arrive at a common 
denominator for analyzing each source's time characteristic, a unit 
search time v;as computed by dividing the total search time for a source 
by the number of relevant references found in that source. For example: 

'f85 minutes search time for ASCA . . / ^ . .j. 4.. 

— ^ ^ 4.45 minutes/relevant citati 

109 relevant references found in ASCA 

Each relevant citation found in the ASCA search cost the Searcher 
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4.43 minutes of her time. Results for all sources are displayed in 
Table V. . 

Table V. Time Characteristics for Search Components 



Component 


Total Search 
Time - minutes 


Relevant 
Citations 


Unit Time 
Cost 


AoUA. 


)i Oct 


109 


kJ\3 min/cit 


Kh(X)N 


260 


0 ). 
84 


3.1 


nin/cit 


BIOSIS Std. Prof. 




31 


6.^2 min/cit 


MEDLARS 


11^+ 


30 


3.8 


min/cit 


M 




20 


30.7 


min/cit 


CA 


1878 


110 


17.1 


min/cit 


lAA 




117 


4.3 


min/cit 


STAR 


258 


5 


86 


min/cit 



The low unit time cost of the computer searches were certainly 
no siu"prise> but the e>rtremely competitive value for lAA was most 
unexpected. VThen compared with CA which yielded only 7 less relevant 
references, the difference becomes even more dramatic. Several factors 
may account for this. First of all, time was saved since semi-annual 
indexes v;ere available for lAA but not for CA. Second, the classified 
arrangement of the lAA subject index greatly reduced search time. Third, 
the use of full titles in the lAA subject index reduced the number of 
abstracts to be consulted for relevancy. For example, all references 
indexed under a particular term by lAA are listed by full title under that 
heading. Tlie term "abiogenesis" in volume 13, niunber 6 listed 16 items. 

O 
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The titles indicated that all l6 were higlily relevant. At least 6 
terns nn.i3t be consulted in the CA keyivord subject index to retrieve 
abstract niunbers of these same references. 

Ific tlu^ec citations retrieved from STAR were extremely costly 
in time. Not only did the Searcher spend 86 minutes for each of the 
citations, butt \4ien the uniqueness percentages are analyzed, it is noted 
that all three are duplicated. 

Each relevant citation from BA cost almost t\sdce that for the ones 
retrieved from CA. The format of the permuted B.A.S.I.C. index is probably 
the biggest factor in slowing the search of BA. 

The CA total search time is maximized since no cumulative subject 
indexes v;ere available at the proper time. The cumulative issue did 
not arrive in the library until the individual issues for that volume 
had been searched. Time did not allow for a duplicate search when the 
cumulative issues did arrive. As a consequence, it was necessary to search 
78 individual subject indexes. 

V/ithin the computer search mode, the BIOSIS Standard Profile 
search v;as the most costly in time. In spite of a format easily read 
and transcribed, the many extraneous references greatly increased the 
total scanning time for a source already low in precision. The RECON 
search achieved the best results vrith a time cost of J.l minutes per 
relevant citation, follov;ed closely by the MEDLARS search vath a time 
cost of 3«8 minutes per relevant citation. The Searcher believes 
this to be the result of the greater control exercised on these tv;o 
searches and the subsequent smaller number of retrievals. Time was 
not spent v/ith the type of "noise" that vjas found in the ASCA ai:d BIOSIS 
Standaixi Profile searches. 

ERLC 
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DOUAR COST CHARACTERISTICS: 

Mo common cienominator for costs could be ascertained for the 
eight sources. With the data available, absolute costs in dollars 
could be detennined oixly for the two commercial computerized services, 
ASCA and BIOSIS Standard Profile. Because of the great variables involved, 
no attempt to compare machine search costs va.th manual costs vd.ll be 
made. However, there are some natural subgroupings which can be analyzed 
to somo extent • Of the computerized sources, two are commercial current 
awarenoi3G ventures subscribed to by the User and administered by the 
library. Table VI sumrrarizes the cost characteristics of these two 
seivices. Tlie vast differences in the unit costs reflect the degree of 
personal tailoring i.n the profiles. 

Table VI. Cost Characteristics for Commercial Computer Search Sei^vices 

Commercial Sen/ice Animal l8 months Librarian Unit Cost 

Cost Cost Seai'ch Cost per citation 

@$5/hour (relevant) 

ASCA $-j02 $588 $hoJiO 15.76/cit 

BIOSIS Std. Prof. $50 $ 75 I16.35 $ .52/cit 



ThQ two remaining computerized sources are subsidized by the U. S. 
GoveiTment and at the ti/ne of this study were free to the Usvr. Tl\e 
costs involved were only those of the Searcher* s time. Table VII 
summarizes the pertinent data. 
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Table XHLl. Cost CharacteriGticG for SubGidizod Computer Searches 

Subsidized Sewicc Librarian Search Unit Cost Per 

Cost Relevant Citation 

RECON $21,50 $ .26/cit 

MEDIJIRS $ 9.50 $ .52/cit 



Within the manual search mode, the great mimbor of variables 
make aiiy attempt to calculate a true unit cost impossible. No data on 
costs of acquiring, catalogin^^, storing or use-amortization of the 
printed indexes v^ere available for this study. For strictly infoiination 
purposes, yearly subscription rates and other costs factors are shown 
in Table VIII. 



Table ^11. Costs Characteristics for Manual Search Components 



Service 


Armual Subscription 
Kate 


18 months 

COGt 


Librarian 
Search Cost 


(Unit Cost) 
(Per Citation)* 


m. 


$1000 


$1300 




($2.56)* 


CA 


$2''v00 


.$5600 


$156. •./"^ 


(Si. 42)* 


lAA 


$ 2MO 


1 'j>60 


.$ 41.90 




ST/\R 


$ 118 
(CXmilativo 
Indexes E:ctra) 


$ 177 


$ 21.50 


($7.17)* 



*n;cse values are based on the Librarian searcli time only and do 
not take any other costs, direct or indirect? into consideration. 
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It is interesting^ to note that even vathoiit considering the subscription 
price, the costs of the manual searches generally were higher than those 
in the eornfxiter mode. 

MA^Jl^AL AIJD COMFJTER COI Ji^CTIVE RESin.TS: 

Data on the oolleetivo efforts, of the four sources in each 
mode - manual and computer - are displayed in Table IX. As was expected 
the i:iach.ine searches retrieved a much larger number of references, yet 
the niunber of relevant items retrieved was sliglitly less tlian that 
z^etrioved by manual means. One hundred thirty three relevant refei^ences 
V\ere retrieved J.n common. 

Table IX. I^lanual Vs. Computerized Searching 
Simuaary of Search Data and Characteristics 



Manual 



Machine 



Total citations retrieved 
IJumber of relevant citations 
Fvocislon 
Recall 

Citafciona not-; to Jsor 
Nov'lty 

Items unii'iue to search mode 

Uniqueness 

Total search time 

Tii:;e cost per citation 

Total Cost 

Dollar Cost per citation 



285 
186 

66% 
115 

53 
19% 

"i23h minutes 
17. 5 niin/cit 

$271.15 
$l.^l6/cit 



2582 
177 

lAf. (16.75^0 
101 

16% 

1055 minutes 
^;.95 min/cit 

$750.75 
$^l.2^l/cit 



•^Searcher ^s tim^:^ cost only 



■x-^ Based on Searcher* s tiine cost only 



• The pixjoinion Tiijairea were calculabc3d by siuiiniing the total 

iiiunbe!^ of rolovant reforoneos retrieved by the collective efforts of 

1 5 

tlie comporiCr^ts of each mode. Virgo argues that this may not produce a 
realiirtic figai'c since unusually liigh or low perfonnance by one or more 
of f}>n nnrnponents of the iuuuu can yield 1^3 suits atypical of the mode 
as a ;%aolo. For example, ^he gap between the lowest and the higliest 
number of items within the marp;al mode varies by I33 items, yet the 
highest ai:id lowest precision percentages vary by only 20 percentage 
point:; (Sec Tabic II). To allow for the unnatiu^ally high and 
lov; perfomances of STAR and ASCA , precision percentages viero also 
calculated as Virgo su^Kcsts, by averaging the precisions i>3rcentages 
of the components of each mode rather than totaling raw niambers . These 
adjusted figui^es are set off in parentheses. 

The manual search outperfoms the machine search in four of the 
six cliaracteristics displayed. (We can not be sure about the dollar 
cost per citation since v;e are showing only partial costs foi^ the 
maniiai searchr) 'Ulie recall factor varied little. Since the computer 
datcl-base coverage is fairly well duplicated by that ol' the printed 
indexes us?.'d for m.anual searching, this closeness v;as cx]jected. Also 
e>:peeted v;as the superior time perfor-mance of the computer search, 

Ttio vridely- spread precision percentages was not entirely unexpected 

v;lion it is ivomembered that the manual figures are based on the niunber of 

12 

relevant documents found or retrieved from abstracts. Lancaster 
feels tiiat a more accurate comparison wuld have resulted if all abstracts 
scarmod v;ere coiuitod as rf^trievals. Tlais v^uld have affected CA retrieval 
figiir';:"; in ryirti 'ular. Its koy;'roTxi subject ind. x has no standai^dization 
of tenns and broader subject headings must be consulted as v/ell as more 
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nari^ov; tenns. This mQans that many abstracts may be scanned under the 
broader terns in oixier to yield a few relevant citations. It is the 
belief of this Searcher thitt the into1 leotna'l effoT^". expended in a inanual 
^-^earch parallels the mechanical energy expended in a machine search. If 
the computer were designed to exercise this same selectivity against i\ 
data-base comprised of full text or abstracts, the increased costs at 
this time would probably negate the increased perfomance. 

The novelty percentage spread was only 5 points - not a dramatic 
difference, but of some consequence when an exhaustive search is 
desired. The important factor hero is that Kl^ of the references retrieved 
through the manual m^^de were new to the Ufjer. 

Value to the overall search can be pinpointed in the uniqueness 
factor. Tlie manual search contributed 53 relevant items out of a total 
of 28r ;hich v;ere not duplicated by any other source. This means almost 
1/5 ,th of the final product v;ould be missing if the Searcher had relied 
on mechanical means only. Conversely, the machine search contributed 
'hh or almost 1/6 th of the search. From this point of vievj, the uniqueness 
results of the two modes seem somewhat more substantial. 
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CliAPTER 5 
SUT^IARY AIJD CONCLUSIONS 



Table X vas designed to simnarize the results of the study 
by iieareh component relationship rather than number values. Components 
havt' been ranl-ied on a basis of 1 to 8, ^^dth 1 being the most desirable 
perfoiTnance and 8, the least desirable. Upper positions on the chart show the 
most valuable components. A dotted line has been used to separate "good" 
perfoiTiiance from ^'bad'^ v/itli the manual search components underlined. A 
quick glance at the concentration of underlined components indicates the 
relative value of the ciunulative effort of manual versus machine modes. 

Table X« RanV:od Summary of Ctiaracteristics for Search Components 





Prooision 


Recall 


Novelty 


Uniqueness 


Time Cost 
Pel' Citation 


Dollar Cost 
Per Citation 


1 

1. 


STAR 


lAA 


CA 


CA 


RECON 


RI'XON 


2 


lAA 


CA 


ASCA 


ASCA 


I^LAl^S 


MEDIARS 


"A 


BA 


ASCA 


lAA 


BIOSIS 


lAA 


BIOSIS 


\ , 
il 


CA 


RJ2C0II 


RECOK 


BA 


ASCA 


ASCA 






BIOSIS 


MEDLARS 


lAA & 
I'EDIARS 


BIOSIS 




h 


MI'DIARS 


MEDLARS 


BIOSIS 




CA 




/ 


ASCA 




M 


RECON 


RA 




r. 

o 


niosis 


STAR 


STAR 


STAR 


STAR 


_ x-x-x- 



•x-x-xinsuf ficicnt data available for rankings 
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To arri v^e at a mmierical ranking of the eight sei-vices vised, 
one to eigivt points vierc assigned on the reverse basis as position 
v/as av/arcied above; 8 pointo for the most desirable , 1 point for the 
least desirable. Dollai* cost infonnation was excluded since it 
v;aG incomplete. For example, CA won l6 points for two first positions, 
7 points for a secorxd, 5 ^or a fourth and 5 for a sixth. This made a total 
of J)l points. UT:ien the points for each service were totaled, the 
I'ol lovfiiii:- munerical rarikings v/ere established: 

Points 
31 
31 

27 
2h 

21 

18 

17 
12 



Tnis study has roinforcr-d the Searcher's belief that there are 
so many variatloG in every search, absolute and accurate qu-intif ication 
is irnposciblo. Search strategy, data^base coverage, indexing policy 
and vocabulary, and index format differ with eveiy system. TiiovofovQ ^ 
v;e are unable to state that any of the results are absolute or tliat ^ny 
of these results vrill hold "Giiio for a l\iture search. We can only state 
that for the purpos'^s of l\itur^'^ li.teratiu:"e surveys on the subject of 
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Sorvico_ 
lAA 
CA 
ASCA 
RECOII 
MIIDLARS 

BIOSIS Std. Prof. 
BA 

STAR 



Chemical Evolution and Origin of Life, the results of this study 
indicate: 

1. that the most effective sources for overall rocall, 
precision, novelty and uniqueness are Chemical Abstracts , 
International Aerospace Abstracts and the Automatic Subject 
Citation Alert (ASCA) seivlce. 

2. that the most efficient sources in tems of time and costs 

arc the two subsidized computerized sources, RECON and MEDLARS 
5. that a more proficient search could result by 

a. dlscor^tlnuing the STAR search < 

b. designing a more precise profile for the ASCA search. 

c. keeping MEDLARS sewice only so long as it is free. 

d. keeping BIOSIS Standard Profile service as long as 
the present low cost holds. Otherwise this service 
should be discontinued and a manual search of 
BIORESEARCII INDEX in conjunction with Bk be 
initiated to replace its uniqueness value . 

4. that there is a sore need for more cost infomiation before 
valid conclusions can bo reached concerning manual search 
efficiency . 

5. that a better designed search strategy for RECON is needed, 
one proficient enough to obviate the need for lAA . 

Although the manual search had a slightly better overall perf onnance , 
it is concluded that neither mode is adequate for the production of a 
comprehensive bibliography of a multi-disciplinary nature. For an 
exi"iau:;tive sun/ey of the literature, the errorta of both are necessary. 
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APPEIJDIX A 

Svibjoct Headings Used To Retrieve Chemical 
Evolution and Origin of Life Literature 

Abiogenesis 
Blochomicvil Evolution 
Biogenesis 
Biogeochemistry 
Carbonaceous Chondrites 
Cheinical Evolution 
Exobiology 

Extraterrestial Life 
Origin of Life 

Primitive Atmosphere, Earth, Environment or Ocean 



Group A 



used in conjunction with 



Group B 



Amino Acid(s) 
Ammonia 

Dluc'-grccn Algae 
Carbohydrates 
Fatty Acids 
Fomaldehyde 
Hydrogen Cyanide 
Methane 

Nicotinic Acid 

Isfucleosides 

fJuclcotideG 



Organic Compounds 

Organi c Matter 

Peptides 

Pho spho ry 1 ation 

Photosynthesis 

Porphyrins 

Proteins 

Proteinoids 

Purines 

Pyrimidines 



Apollo 

Extraterrestrial Environment 

Interstellar Molecules 

Jupiter Atmosphere 

Lunar Enviix^ninent 

Mars Atmosphere 

Prebiotic • 

Precambrian 

Preccilular 

Primordial 

Venus Atmosphere 
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AUrOMATtC SUBJECT CITATION ALERT 

a service of the INSTITUTE FOR SCIENTIFIC INFORMATION 



OR, KEITH KVENVOLDEN 
NASA - AMES RfcSEiARCH 



CENTER 



CHEMICAL EV0LUTl0N,239-9 
MOFFETT FlELDt CALIFORNIA 



94C35 



M1440 ACCOUNT NO • 



Ml 440 



31 JUL 73 



ASCA PPOFILE REVISION FORM 



TrJSs^KM^pAU'^^h '^i^F.^ri^^^^^^ CARBON. CAOSS OUT DELETES. ADO 
fLRMS ON LAST SHc-T USING NEXT HIGHER "* -v-^i-^. muu 



NUMBER. RETURN UhIGINAL. 



THIS ASCA PROFILE HAS BEEN IN EFFECT 7 MONTHS 



TERM 
NO. 



NAME t INI TI ALS 
OR OTHER TERM 



AI 



1 OPAR IN 
I AOPAR IN 

2 MILLER 3L 
2AMILLCiR 

3 CPO J 

4 FOX SW 
4AF0X 

5 CALVIN M 
SACALVIN 

6 PONNAMPE^'^UMA C 
6AP0NNAMOcf-:UMA 

22 Gf-GcL L£ 
22A0RG£L 
26 SAGAN C 
26ASAGAN 
31 JUPITE-R 

33 WALO G 

34 ntkKNE!^ L V 

35 AOFLSON PH 
3B ur<EY H/ 
3aAURt:Y 

39 ANDHRS E 
3 9AAN0S::f<S 

40 KVfNV'JLDCN K 
4CAKVFNV JLOEN 

41 EGLINTCN G 
41AEGLlNrCN 

42 SCHC^F 

4PASCH0PF 

43 CLCUO PE 
43ACLCUD 

44 NAG Y n 
44ANAGY 

45 PPF.yiO/ 

46 CHflMICAL EVJL/ 
46A CHEMICAL -EVOL/ 

AH 10/ 

ORGAN IC GEOCHclM/ / 
OmG AN I C -G u OCHr: M/ 
M^TSrG'^MT/ 
t XTRAT^R^^ST-51 AL 
EXTOA T=^i<o>l STP I AL 

extra-Tc^n::st^ial 
prccamhri an 

carhonactous chonopi t/ 



CITED PUBLICATION 
OR (CLASS OF TcRM) 

( CI TED AUTHOR) 
<CITEO AUTHOR) 
( CITED AUTHOR ) 
( CI TED AUTHOR ) 
(CITED AUTHORS ) 
( CITED AUTHOR > 
(CI TfiD AUTHOR ) 
( CI TED AUTHO:^ ) 
( CI TED AUTHOR ) 
(CITED AUTHOR) 
(CI TED AUTHnr*3 ) 
(CITED AUTHOR ) 
( CITED AUTHOR ) 
(CI TED AUT^iOR ) 
(CITED AUTHOR? ) 
(CITED AUTHOR) 
( WORD ) 
ANN NY ACAD SC I 
P NAS US 
P NAT ACAD US 
( CI TLD 
(CITED 
(CiTrD 
( CITED 
(CI TED 
(CITED 
(CITED 
(CITED 
(CITED 
(CITED 
( CITED 
( CITED 
(CI TED 
K CITED 



VOL LOW HIGH YR 
(TYPOPAGE PAGE 



(TP I) DA 

69 362 
53 1215 
55 1365 




AUTHQr^ ) 








AUTHOR ) 








AUTHOR ) 








AUTHOR ) 








AUTHOR ) 








AUTHOR ) 








AUTH JR ) 








AUTHl^R ) 








AUTHIJhi ) 








AUTHOR) 








AUTHOR ) 








AUTH^DR ) 








AUTHOR > 








AUTHOR ) 








( WORD) 


(TP 


1 ) 


DA 


(WORD ) 


(TP 


1 ) 


OA 


(WORD ) 


(TP 


1 ) 


DA 


( WORD) 


(TP 


1 ) 


DA 


( W3C;o) 


( TP 


U 


DA 


(WOKD) 


(TP 


1 ) 


OA 


(WORD) 


(TP 


1) 


DA 


(WORD) 


( TP 


1 ) 


DA 


(WORD) 


(TP 


I > 


OA 


( WO'^0) 


( TP 


t ) 


DA 


( WORD ) 


(TP 


I ) 


OA 


(WORD) 


(TP 


1 ) 


DA 


k2 









57 
65 
66 



S 

9 
9 
9 
9 

11 
9 

13 

9 

7 
3 
3 
3 
16 

9 

9 

9 

9 

9 

9 

7 
10 

.7 ■ 

7 
7 
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AUTOMATIC SUBJECT CITATION ALERT 

a sofvico Of the INSTITUTE FOR SCIENTIFIC INFORMATION 



M1440 ACCOUNT NO < 



31 JUL 73 ASCA PROFILE PeVIStON FORM 

Tt-IAf^ ON Vf£;5TlCAL LINE. iNSi^RT Q^f^i^OA• CROSS OUT OELHT^S* ADO 
TERMS CN LAST SH^eT USING NEXT HIGHER NUMBER • RETUkN ORIGINAL* 

THIS A3CA PROFILE HAS QEEN IN EFFECT 7 MONTHS 



TERM NAME* INIT lALS CITED PUBLICATION VOL LOW HIGH YR 

NOt OR OTHER TERM OR ( CLASS OF' TE RM ) (TYPE)PAGc PACE $ 



52A CARBONACEOUS 


--CHONDPIT/ 


( WvlRO ) 


(TP 


I) 


OA 








53. INTERSTELLAR 


VOLECUL/ 


(WORD) 


(TP 


1 ) 


DA 






10 


53A INTERSTELLAR 


-MOLECUL/ 


(WORD) 


(TP 


I ) 


DA 








54 VLNUS 




(WORD) 


(TP 


1 ) 


DA 






B 


66 MARS 




( WORD) 


(TP 


1) 


OA 






7 


56 tinrNAL JO 


PHYSICAL BASIS LIFE 








S\ 


4 


57 ViALOANc JHS 


ORIGIN LIFE 






242 


251 


67 


4 


57A*Jcr<NAL JO 


ORIGIN LIF 


E 


ED 




242 


251 


67 




5 6 HOLLAND HD 


PFTRCGRAPHIC STUD lES 




44 7 


477 


62 




59 RU^icY 


CRUST EARTH 






631 


650 


65 


4 


59AP0L0ERVAART A 


CRUST EARTH 


ED 




6 31 


650 


55 




60 KAPLAN !R 


(CITED 


AUTHOR ) 












9 


6CAKAPLAN 


(CITEO 


AUTHOR) 














61 JOVIAN 




(WORD) 


( TP 


I ) 


DA 






7 


62 MARTIAN 




(WORD) 


(TP 


1 ) 


DA 






7 


63 CCMcT/ 




( WOr^O) 


(TP 


I) 


DA 






7 


64 AST-7-tCIO 




(WORD) 


(TP 


I ) 


OA 






7 


65 CARBON 




(WORD) 


(1 


Al ) 


HO 






24 


66 NITROGEN 




(WORD) 


( I 


Al) 


HQ 






14 


67 ISITOP/ 




(WOROI 


(I 


A2) 


HD 






15 


66 STEVENSON F J 


(CITEO 


AUTHO?^) 












9 


6eASTEVtZNSaN 


ICITEO 


AUTHOR ) 














69 SOitOi-'N fj 


<CtTuO 


AUTHOR > 












9 


69AS0W0EN 


(CITEO 


AUTHOR) 














70 HUMl C 




(WORD) 


(TP 


1 ) 


OA 






7 


7 1 FULVIC 




( WORD ) 


(TP 


1 ) 


DA 






7 






TOTAL 


1 NOW 


IN USE 






392 
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MEDLARS SEARCH REQUEST APPENDIX C 



MKDLARS SI-:A»i:U STATION 
PACIITC SdU I l!»KS'|■ 
HH.l^^^^l NU niCAI, l.lliRAHV S^HVICK 

HUAIK.mCAi, MllHAHY 
IMK CKNi KH roll 1 IIK Hh:Ai;| n scikncks 
rNHT.HSn V OK CAI IKOHMA 

I OS AN(;rTT:s, (:AJ,u()iiNrA^>oo2v 


DATE 

5 Febmary 1973 

Is this your first request lo MrT)LAUS> 
' i: Ives KIno 


I INDIVIDUAL WHO WILL ACTUALLY USE THIS BIBLIOGRAPHY 

Dr. Keith Kvenvolden 


TELEPHONE NO. 

965-5181 


Z TITLE 

Chief, Chemical, Evolution Branch 


3- ORGANIZATION r f>r/»cirf nn ri i /<uf. .hi 6'ranr/i, niv^siun, clc ) 

Life Sciences LibraiT. N239-i;5 
MSA> Ames Research Center 


4. ORGA N 1 Z A T tON <t/'ni v'ef$if>\ Corpor»< t/tiri, Comptiny, vtc ) 

Government Agency 


5 STRPET ADDRESS AND CITY 

Moffett Field, California, 9^035 


6. STATE AND ZIP CODE 

CALIF. 9't035 


7 REQUEST SUBMITTED BY <ll dtHt^rcnt from iihoYe> 

Linda Gufstasen, Ref. Librarian 


TELEPHONE NO. 

965-5387 


8- SEARCH ANALYST ^/.p.ivc /n.in^c; 



ALt OF THE QUESTIONS THAT FOLLOW ARE DESIGNED TO PROVIDE INFORMATION NEEDED TO DEVELOP A BIBLIOGRAPHY 
THAT »S RESPONSIVE TO YOUR NEEDS. YOUR CARE IN PROViDlNC FULL INFORMATION WILL AFFECT THE USEFULNESS OF 
THE CITATIONS THAT YOU WILL RECEIVE. 

DETAILED STATEMENT OF REQUIREMENTS: Plc.^sc describe, in your own words* the subject rn.ittcr for which the search is to be 
toniucicii. [<v as spccitic as possibte. Ocfinc any terms that may have special meaning in your request- Also if there are points 
NO I ro bt irk luJed. please slate these. 



This is a request for an update of previous searches no. IOOI5I and 
no. 10753 v.Mch together covered material published from January 1964 
through Rarch 1972. We would like tp have it updated through the 
present date (February, I973). 

The original request read as follov;s: 

"Vfe desire a comprehensive search of MEDLARS literature on 
chemical evolution and the origin of life in the universe. Terms 
v;hich might clarify in what areas we are interested are: 

Origin of Life 

Chemical evolution 

Pre-Caxnbrian organic chemistry 

Exobiology (life on other planets) 

Spontaneous generation 

Biogenesis 

Abiogenesis" 

V/e v/ould like to add the following terms: 

Prebiotie synthesis (organic compounds) 
, . Extraterrestrial orfi;anic chemistry 



PI \1 M.IM : <Alh)W i weeks) 



iK O 4 {formerly PHS.4667-I) 

(ERIO^) 



45 



ongoing bibliography 



NO RESTRICTIONS 

LJ HUMAN SUBJtCTS 

[□veterinary MEDICINE. « f only ce»a in an ima Is or an icna 1 g.oups arc of inter. SI. pleasMist 1^^^ 



CJAKIWAL EXPERIMENTS: If only cerrain animals or animal groups are of ioterest, please list these: 



f MALE 
' FEMALE 



1 NORMAL STATE 



[ J DISEASED STATE 



L ] CLINICAL RESEARCH ((eating cl dmgs of technics 
in /rumens on \yi 



^ itU VITRO STUDIES 

(<^f <tnimat or htin:.tn ti^sties ur //aicis only) 



[ J ^*SE HISTORIES 



LANGUAOE RESTRICTIONS: 

tX^^^^EPT ALL LANGUAGES 

I ; ACCEPT CERTAIN LANGUAGES ONLY (f^Uase spBCify) 



[ ] ACCEPT ONLY ENGLISH 



AGE CROUPS: If only cea^un ogc groups re of interest . please indicate ^ich ones: 



lo I monrh 

— — l-2 \ months 



— — 6—12 years 

— 13-18 years 

„ 19-rl i years 



15-64 years 
65- years 



GEOGRAPHIC RESTnlCT^ONS: If cnly certain regions are of imercs,t, please Ust these: 



I ^"'-""^^^ PAPERS- Please caro' out a pfeliminary literdlure search of your own bcfocc submitting this request to 

MKm.AK.V au4 supply lull bihlio^r.*phic citations below fcr relevant articles you have found. Wheievet possible, they shoulj be 
Lou, nl."r 1^ i^^^'^^ti''; "T " ^^T"'^' V"^" citations ^,11 he used as a guide iri retrieving similar citations related to 

youi needs. n><^> sv.ll alsi, be used in a later appraisal the results of this search. If no relevant papers have beea found. please 
siatt' none round ' 



Dowler, J., Puller, D. , Orgel, L. E, , and Sanchez, R, A. '^Prebiotic Synthesis 
of Propiolaldtihyde and Nicotinaanide^S Science , I69 (3952): 1320*-lj521, 
Sept,, 25, 1970. 

^' Harads, K. "Chemical Evolution - Chemical Basis for Origin of Life", Protein 
(Tokyo) 15:855-869, July, 1970. (Japanese) 

Ebx, S, W., et al, "Bio-organic Coir.pounds and Glassy Microparticles in Lunar 

Pines and Other Materials^', Science l67(39l8) ; 767-770. January 50, 1970. 

Neuman, , Neumn, W. P., and Lane, K. '^On the Possible Role of Crystals in 

the Origins of Life: IV. The Phosphorylation of l^fucleotides'*, Ciirr. Mod . 
Biol . 5:277-28:5, July, I97O. 



[f you used (NOKX Mf-tXClfS for your ptctiminaty search please list the subject heading;* ufider ^hich you sou^^ht 



ciiatiorvs: 



ABIOGENESIS 
EVOLUTION 

EXTRATERRESTRIAL ENVIRONMENT 
ORIGIN OP LIFE 



UvSEARCH REOUtReWENTS- Plta%t «heck on« of tSe boK«t btlc*^ fo in^tcat* tke t^p» of teofcK rhot you would pr«lti: 

f?!-,^['^i^ search desij^ned to retrieve as many as possible of the relevant citations, but whcch 
mi^hi also retrieve many irrelevant citations. 

Li ^^"^ff^^ 'i'^''^f<-'h desi>;f>ed ID retrieve some oniy of the relevant cit^^ with few 

accompanying irteievant citations, 

MUMBER or CITATIONS EXf>ECTEO: Pltoie thtck th* fippropfio^e bon io inaicot* fh* number of (ournot arlicfes deolirtg with the subi*ef of 
rouf fequ»»t fSof you eoniider to have b«t« publitHt^f iirtc« Januory 1^66. 



r 1 0 



[50 lot « 200 
[^"J Z0< - SOO 



Il4, PRINT ON: 



ISi: H^W did yoy tut* beot ob^ul MEDLARS? 



pf js 1* * )1* paper 
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APPENDIX D 



RECON Search Strategy 



!• Abiogenesis 2h, 

2. Biochemical Evolution Zo» 

3. •Biogeochemistry 26. 
Carbonaceous Ifeteorites 27. 

5. Chondrites 28. 

6. Ext^raterrestrial Life 29- 

7. Origins 50- 

8. Combine 1-7 (Or function) 51- 
^9. Limit to 1972-75 input J^2. 

10. Amino Acids /55- 

11. Animonia 3^. 

12. Blue-green Algafe 

13. Carbohydrates 36. 
Ih. Poraaldehyde 37- 

15. Methane 38. 

16. Nicotinic* Acid 39. 

17. Nucleosides hO. 

18. Nucleotides hi. 

19 . Organic Compounds ^2 . 

20. Photosynthesis ^3* 

21. . Porphyrins . . . , = . A^v 

22. Proteins ^^5- 

23. ; Proteinoids 46* 



Phosphorylation ky . Combine 9 and 46 

(Or function) 

Peptides 

Pyrimidines 48. Print 47 

Purines 
Adenines 
Alanine 
Cysteine 
Lysine 
Methonine 

Combine 10-19 (Or function) 
Combine 20-29 (Or function) 
Combine 33, 34 and 30-32 (Or function) 
Extraterrestrial Enviornment 
Interstellar Matter 
Jupiter Atmosphere 
Mars Atmosphere 
Venus Atmosphere 
Lunar Composition 
Meteorites 
Precambrain Fferiod 
Coinbinevj^^ 

Combine 35 and 44 (And function) < 
Limit 1972-73 input 



